Skip to main content

View Post [edit]

Poster: LucasMation Date: Apr 1, 2016 12:03pm
Forum: web Subject: Re: how to query for all the websites that end in '.com.br'?

OP here. I managed to make some progress using the wayback-cdx-server API. (https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server)

The following are for *.com.br, and *.gov.br

http://web.archive.org/cdx/search/cdx?url=com.br/&;matchType=domain
http://web.archive.org/cdx/search/cdx?url=gov.br/&;matchType=domain

You can also limit the number of items returned:
http://web.archive.org/cdx/search/cdx?url=*.com.br/&;matchType=domain&limit=1000

Reply [edit]

Poster: pegzmasta Date: Apr 1, 2016 12:19pm
Forum: web Subject: Re: how to query for all the websites that end in '.com.br'?

Now, THIS is interesting!

I didn't think that you would have the patience to explore the API route. Extra kudos to you for researching this! The resource on GitHub is definitely worth listing. You can only do so much using the usual web interface that everyone is currently familiar with, but this API grants so much more control and functionality.

Reply [edit]

Poster: sahil7459 Date: May 25, 2017 5:00am
Forum: web Subject: Re: how to query for all the websites that end in '.com.br'?

does that url work for you, its not working at this end